System, method, and computer readable medium for creating a video clip

ABSTRACT

The present invention provides a system, method, and computer readable medium for creating a video clip. In one embodiment, a method, comprising creating a copy of a still image by a first module, the copy of the still image comprising reduced dimensions of the still image, creating a new still image from a selected area of the still image, automatically ordering the new still image and the copy of the still image, creating an audio file by at least one of a second module and a third module, creating a timeline, by the first module, related to the ordered images and the created audio file, and rendering the timeline into a video clip by a fourth module, wherein the timeline includes a length of the audio file, a length of the video clip, the ordered images, a display time of each of the ordered images, transition identifiers, transition lengths. The present invention further discloses a system for assembling and distributing multi-media output, comprising: a rendering server; a web server; and storage, wherein the servers and the storage are operably coupled; the storage adapted to receive digital media and properties of the media, store the media and the properties, and transmit the media and the properties; the web server adapted to perform at least one of a following action: retrieve the media and properties of the media; manipulate the media and the properties; assemble the properties; and transmit at least one of a following element from a group consisting of: the properties; and the assembled properties; and the rendering server adapted to receive commands from the web server.

CROSS-REFERENCE TO RELATED APPLICATIONS

The present invention is a continuation of and claims the benefit andpriority of U.S. patent application Ser. No. 11/293,005, filed on Dec.2, 2005, entitled System, Method, and Computer Readable Medium forCreating A Video Clip, which is related to and claims the benefit andpriority of U.S. Provisional Patent Application Ser. No. 60/632,410,filed on Dec. 2, 2004, entitled Picture To Video (Pic-2-Vid) In InternetCommerce and which is a Continuation-In-Part of U.S. patent applicationSer. No. 10/773,130, filed on Feb. 5, 2004, entitled System And MethodFor Assembling And Distributing Multi-Media Output, which is related toand claims the benefit and priority of U.S. Provisional PatentApplication Ser. No. 60/445,261 filed on Feb. 5, 2003, entitled SystemAnd Method For Generating A Unified File Name. The entire contents ofeach of the above noted patent applications are enclosed by referenceherein.

FILED OF THE INVENTION

The present invention is generally related to video, and morespecifically, to a system, method, and computer readable medium forcreating a video clip. The present invention relates generally tomulti-media content and, more specifically, to a system and method forassembling and distributing multi-media output. The present inventionalso relates generally to multi-media content and more specifically, toa system, method, and computer-readable medium for assembling anddistributing multi-media output.

BACKGROUND OF THE INVENTION

The transcoding of pictures to video is a known technique that includesU.S. Pat. No. 6,195,503 that describes the transformation of compressedimages and audio into video. Also, video editing using server Internettools, including the transcoding and rendering of pictures into video isknown. However, a limitation exists for using pictures as a tool forInternet commerce. Almost all sites for selling and buying items on theInternet use simple pictures as the main tool for explaining,demonstrating and educating customers about these items. Pictures aresimple to take and use. Further, pictures do not use complex editingtools and processing power and do not require a high level of expertiseto either take, process, or deliver.

It would be very beneficial if improved systems, methods, and computerreadable media are devised which allow better options to conveypertinent information while using the same pictures that are used today.

Large numbers of organizations are producing and making use of video,audio, flash animation, HTML and pictures collectively known as MultiMedia Content (MMC). There is also an abundant amount of video and audioin analog format (such as tapes) which are typically converted todigital format. Another major source of MMC is commercial materialproduced by the entertainment (movie studios) and broadcasting industry(TV), as well as individuals using camcorders. Most of the MMC isdistributed on CD's and DVD's. Production of such media is costly anddistribution via the mail system is time consuming.

An alternative to DVD's and CD's is electronic distribution that can beaccomplished via a Local Area Network (LAN), Wide Area Network (WAN),using TCP/IP via a public network (the Internet), or via an internalsystem (Intranet). Other means of distribution are wireless such asmicrowave, a cellular network, and a WI-FL network, for example.However, MMC content, (especially video) typically comprised largefiles, and distributing such content electronically can be veryexpensive, time consuming, and in many cases, simply impossible due tothe limited capacity of the receiving device.

Trading, licensing and selling of MMC by commercial providers (such asmovie studios, TV networks, sport channels, etc.) is cumbersome sincethe providers may consider the content to be proprietary and may find itdifficult to prevent a receiver of the content from creating multiplycopies.

Progressive download, widely known as Streaming Media (a client-serversystem), is an excellent solution since the encoding process reduces theoriginal file size by 80-90%. Upon request, the server sends a smallamount of data (“Buffering”). As soon as the buffering is completed thereceiving device starts the play back while the process of downloadingand decoding occurs in the background, often times simultaneously. Theprocess of encoding MMC to a streaming format, however, is cumbersome,time consuming, and requires significant technical expertise as the userhas to select a wide range of parameters. Furthermore, the nature ofTCP/IP and a secured network, block the user from direct accesses to theoperating system and file storage process. A separate process ofuploading is required and the final stage of storage and indexing forretrieval must be done by authorized personnel (for example, a systemadministrator).

Other issues that prevent wide use of MMC content include:

1. Once the MMC is uploaded it cannot be changed—any change requirescreation of a new file (rendering) and repeating the upload process;

2. Streaming video can be played within flash and HTML but there is noway to tell what and when the receiving device will play each componentsince the buffering time can change randomly; and

3. There are many types of receiving devices using many communicationprotocols, players and streaming technology. Distributing MMC instreaming format also enables the MMC provider to license the use ofcontent without proprietary concern since the progressive downloadprocess prevents it. However, establishing a commerce platform forlicensing and trading MMC requires an agreed upon protocol and a large,centralized database to monitor the transactions. Many attempts to do sohave failed. It is therefore desirable for the present invention toovercome the aforementioned problems and limitations associated withmulti-media output.

SUMMARY OF THE INVENTION

The present invention provides a system, method, and computer readablemedium for creating a video clip. In one embodiment, a method,comprising creating a copy of a still image by a first module, the copyof the still image comprising reduced dimensions of the still image,creating a new still image from a selected area of the still image,automatically ordering the new still image and the copy of the stillimage, creating an audio file by at least one of a second module and athird module, creating a timeline, by the first module, related to theordered images and the created audio file, and rendering the timelineinto a video clip by a fourth module, wherein the timeline includes alength of the audio file, a length of the video clip, the orderedimages, a display time of each of the ordered images, transitionidentifiers, transition lengths.

In another embodiment, a system for creating a video clip comprises adevice configured to create an audio file, and a web server configuredto create a copy of a still image, a new still image from a selectedarea of the still image, wherein the new still image and the copy of thestill image are automatically ordered, and a time line related to theordered images and the created audio file, wherein the time lineincludes a length of the audio file.

In a further embodiment, a computer readable medium comprises creating acopy of a still image by the first module, the copy of the still imagecomprising reduced dimensions of the still image, creating a new stillimage from a selected area of the still image, automatically orderingthe new still image and the copy of the still image, creating an audiofile by at least one of a second module and a third module, creating atimeline, by the first module, related to the ordered images and thecreated audio, and rendering the timeline into a video clip by a fourthmodule, wherein the timeline includes a length of the audio file, anidentifier of the audio file created by the second module or the thirdmodule, or a volume envelope associated with the audio file.

The present invention also achieves technical advantages as a system,method and computer-readable medium for assembling and distributingmulti-media output. Various embodiments of the present invention arenoted below:

1. Allow any user a simple method of encoding and uploading. This can bedone by setting pre-defined “profiles” containing specific parametersfor encoding, indexing, sorting and uploading any type of MMC. Theprofiles can be created by a system administrator, for example, andstored on a server. The user computing device automatically downloadsthese profiles.

2. The user is able to select any type of content. The system is able toallocate the right encoding and compressing process for each type ofcontent. Setting the content attributes, indexing and encoding are doneon the user's computing device. The uploaded MMC will then be muchsmaller than the original MMC thus saving significant upload time. Sincethe MMC has been identified, described and indexed, the content can beautomatically directed to the right storage device. Retrieval byindexing, attribute and key word search is enabled. Once stored on theserver, the content can be instantly edited by selecting entry and exitpoints for the streaming server.

3. The user can also select separate files and/or segments to be playedtogether as one show (“movie making”).

4. Allow the user to add voice to the MMC by means of a telephone,cellular phone, microphone and other similar devices. The user is ableto play the voice over while the MMC is played or as an introductionbefore the MMC. The user should also be able to control the volumesetting of the audio channels.

5. The user is able to mix and integrate different types of MMC such asvideo, audio, animation and pictures instantly and without rendering anew file. Since the system stores only the instruction sets and theserver produces the edited clips, made-up movies and customizedproduction, (on the fly), only a fraction of the storage capacity isrequired. The server creates Multi Media Presentations (MMP) that aredisplayed using HTML based platform and Multi Media Messaging (MMS) thatare displayed directly on the device (for example, wireless devices suchas cell phones).

6. Organizations are able to create and store pre-defined templatesallowing their users to change the MMC content, add text, animation andvoice over as needed.

7. The user is able to distribute the MMC in many format such as:

E-mail with a link to the message in HTML format (Vid-Mail);

Independent web site;

Embedded object in a web site;

Multi Media Message (MMS) to cell phone and wireless devices; and

On line instant messaging systems.

8. Security and access level is built into the system such that useraccess to the MMC is controlled. Security features are enabled for theMMS and MMP as well such that certain clips will not play forunauthorized viewers.

9. The system has the ability to automatically attach other MMC to anyMMP and MMS such as advertisement and sponsors' messages. This processis known as “wrapping” and can be done on random basis or triggered byexternal parameters (such as demographic targeted wrappers).

10. The integration and execution of commands between different mediatypes (such as streaming and flash) can be controlled and modified evenafter the publication. Viewers can interfere with the control system viaa computing device or any telephone.

11. The system permits copying and sharing content between differentproject and storage devices based on the user's access level.

12. Allow for search and retrieval of MMC based on a unique identifier,indexing system and keyword search. The search and retrieval is machineindependent and does not require any specific database and/orsynchronization.

13. By defining commerce criteria, such as pricing, duration of licenseand time limits, one can offer the MMC for trade without copying anddownloading the MMC (thus protecting intellectual properties). Theprocess of such trade is independent—one can define the terms ofcommerce and exchange confirmation without any predefined protocoland/or centralized system.

14. Using the MMP and MMS command set stored on the server or on theuser's computer, a new file can be rendered in the background. The newfile is seamless, contains all the elements of MMP or MMS, and caninclude special effect, transitions, embedded text etc. The renderedfile can be stored on the server as a new MMC. The rendered file canalso be sent via MMS or downloaded to the user.

15.The command set can be sent directly to any video editing as a “storyboard”. The editing system is automatically loading the right clips atthe right places and times for the video editor to complete the editingprocess. A tremendous amount of time is saved and the communicationbetween the parties is much more effective.

In one embodiment, is a system for assembling and distributingmulti-media output which comprises: a rendering server; a web server;and storage, wherein the servers and the storage are operably coupled;the storage adapted to receive digital media and properties of themedia, store the media and the properties, and transmit the media andthe properties; the web server adapted to perform at least one of afollowing action: retrieve the media and properties of the media;manipulate the media and the properties; assemble the properties; andtransmit at least one of a following element from a group consisting of:the properties; and the assembled properties; the rendering serveradapted to receive commands from the web server. In another embodimentis a method for creating a unified file name, which comprises: assigninga unique identifier based on a destination of a file; assigning a codebased on a type of the file after the unique identifier; assigning acode based on a user defined category after the code based on the filetype; assigning a code based on a user defined sub-category after thecode based on the user defined category; assigning a code related to atleast one of: a creator of the file; and a creator of a content of thefile, after the code based on the user defined sub-category; andassigning a creation date of at least one of: the creator of the file;and the creator of the content of the file, after the previouslyassigned code. In a further embodiment, is a computer readable mediumwhich comprises instructions for: indicating, via a first instruction, atime index within a multi-media output; indicating, via a secondinstruction, a file within the multi-media output; playing themulti-media output via a first player; receiving an audio file at asecond player; buffering the audio file at the second player; andplaying the buffered audio file during at least one of a followinglocation: the time index at the first player; and at a point the file isencountered at the first player.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 depicts a system for creating a video clip in accordance with apreferred embodiment of the present invention;

FIG. 2 depicts a more detailed system for creating a video clip inaccordance with a preferred embodiment of the present invention;

FIG. 3 depicts a flowchart for synchronization in accordance with apreferred embodiment of the present invention;

FIG. 4 depicts a timer process in accordance with a preferred embodimentof the present invention;

FIG. 5 depicts a transcoding/rendering process in accordance with apreferred embodiment of the present invention;

FIG. 6 depicts a user interface for management of uploading images intothe system in accordance with a preferred embodiment of the presentinvention;

FIG. 7 depicts a user interface for uploading images from a user'scomputer into the system in accordance with a preferred embodiment ofthe present invention;

FIG. 8 depicts a simpler user interface to upload images into the systemin accordance with a preferred embodiment of the present invention;

FIG. 9 depicts a “mixer” user interface where an end user can assemble avideo production in accordance with a preferred embodiment of thepresent invention;

FIG. 10 depicts a video transition chooser interface in accordance witha preferred embodiment of the present invention;

FIG. 11 depicts a click or context menu that is displayed when a userclicks on a thumbnail image in accordance with a preferred embodiment ofthe present invention;

FIG. 12 depicts a “Pan and Zoom” interface window in accordance with apreferred embodiment of the present invention;

FIG. 13 depicts a “Crop Image” interface window in accordance with apreferred embodiment of the present invention;

FIG. 14 depicts a male voice text-to-speech input interface inaccordance with a preferred embodiment of the present invention;

FIG. 15 depicts a voice recording interface window in accordance with apreferred embodiment of the present invention;

FIG. 16 depicts a “Workbench” user interface window in accordance with apreferred embodiment of the present invention;

FIG. 17 illustrates an architecture in accordance with an exemplaryembodiment of the present invention;

FIG. 18 illustrates a receiving and play back in accordance with anexemplary embodiment of the present invention;

FIG. 19 illustrates an uploader in accordance with an exemplaryembodiment of the present invention;

FIG. 20 a illustrates a screen shot of an uploader login screen inaccordance with an exemplary embodiment of the present invention;

FIG. 20 b illustrates a screen shot of a select media in accordance withan exemplary embodiment of the present invention;

FIG. 20 c illustrates a screen shot of a thumbnail creator in accordancewith an exemplary embodiment of the present invention;

FIG. 20 d illustrates a screen shot of a encode and upload content inaccordance with an exemplary embodiment of the present invention;

FIG. 21 illustrates a unified file name in accordance with an exemplaryembodiment of the present invention;

FIG. 22 illustrates a screen shot of the unified file naming selectionof a UFN field in accordance with an exemplary embodiment of the presentinvention;

FIGS. 23 a and 23 b illustrate a storage and unified filing system inaccordance with an exemplary embodiment of the present invention;

FIG. 24 illustrates a system design in accordance with an exemplaryembodiment of the present invention;

FIG. 25 illustrates the process that the owner of digital media uses topublish the media along with its terms;

FIG. 26 a illustrates a screen shot of sister exchange of exportingdigital media in accordance with an exemplary embodiment of the presentinvention;

FIG. 26 b illustrates a screen shot of sister exchange of UFN fields inaccordance with an exemplary embodiment of the present invention;

FIG. 27 illustrates selections from user to populate the SISControllerin accordance with an exemplary embodiment of the present invention;

FIG. 28 illustrates a SISController in accordance with an exemplaryembodiment of the present invention;

FIG. 29 a illustrates screen shots of multi-media presentation creationof selecting media in accordance with an exemplary embodiment of thepresent invention;

FIG. 29 b illustrates screen shots of multi-media presentation creationof selecting destination options in accordance with an exemplaryembodiment of the present invention;

FIG. 29 c illustrates screen shots of multi-media presentation creationof send/save multimedia presentation in accordance with an exemplaryembodiment of the present invention;

FIG. 30 a illustrates a screen shot of editing a clip in accordance withan exemplary embodiment of the present invention;

FIG. 30 b illustrates a screen shot of sequencing clips in accordancewith an exemplary embodiment of the present invention;

FIG. 31 a illustrates an alternate process controlled by user interface;

FIG. 31 b illustrates an alternate process controlled by user interface;

FIG. 32 illustrates a voice over in accordance with an exemplaryembodiment of the present invention;

FIG. 33 a illustrates a screen shot of the voice over application of thephone line monitor in accordance with an exemplary embodiment of thepresent invention;

FIG. 33 b illustrates a screen shot of the voice over application of themedia encoding settings in accordance with an exemplary embodiment ofthe present invention;

FIGS. 34 a and 34 b illustrate a sample of receiving a multi-mediapresentation in accordance with an exemplary embodiment of the presentinvention;

FIG. 35 a illustrates the process flow to trigger animation events;

FIG. 35 b illustrates a command to an animation file to play the endinganimation;

FIG. 36 illustrates an MMS process flow in accordance with an exemplaryembodiment of the present invention;

FIG. 37 illustrates a SISCommand instruction flow and sample SISCommandinstructions in accordance with an exemplary embodiment of the presentinvention;

FIG. 38 illustrates a rendering server flow in accordance with anexemplary embodiment of the present invention; and

FIG. 39 illustrates an M-GEN in accordance with an exemplary embodimentof the present invention.

DETAILED DESCRIPTION OF THE INVENTION

The present invention describes a process of creating video clips fromany given set of digital pictures and of creating an audio track ineither an automatic mode or a manual mode that is interchangeable. Thepictures are provided as a set of Universal Resource Locators (URL's,pointing to one or more servers), uploaded manually or created on lineby cropping and saving many pictures from one source. The audio trackcan be created from several sources including:

-   -   Interactive Voice Response (IVR)—using any telephone system the        user can call into a computer based telephony application that        calls a WMA encoder;    -   Text-To-Speech (TTS)—a TTS module can be used to convert text to        speech;    -   Microphone—connected to a client device which records a voice        file on the server; and    -   Existing audio file—usually pre-recorded audio such as music or        sound effects.

In manual mode, a preview module synchronizes a visual display(play-list) and an audio track for instant replay. a rendering serverreceives a command set (or SIS Commands) from a mixer or a predefinedset of rules (in automatic mode) and creates a video file in .WMVformat, for example. An embedded player recognizes the existence of the.WMV file and plays the video thus eliminating the need to post new HTMLcode.

The present invention further describes processing and deliveringpictures made to video in a secure manner for use in commerce, such asfor use in Internet commerce. Referring now to FIG. 1, a system 10 ofthe present invention comprises a first module (such as a web server14), at least one of: a second module (such as a TTS engine) 16 and athird module (such as an IVR engine) 18, a fourth module (such as arendering server) 20, a fifth module (such as a media steaming server)22, and memory (not shown). The system 10 is accessed by a user device12 such as a computer, cellular phone, and/or any wireless or wireddevice capable of communicating with the system 10. The user device 12is coupled to the fist module 14, which is coupled to the second moduleand/or the third module directly or via the memory. The first module 14is coupled to the fourth module 20 and is optionally coupled to thefifth module 22. In other embodiments, a lesser or greater number ofmodules can be used and the functionality performed by the currentinvention can be performed by these lesser or greater number of moduleswithout departing from the scope of the present invention. Also, theconnections between the modules may be wired, wireless, and acombination of wired and wireless.

Editing Process

In one embodiment of the invention, the user uploads images (pictures),preferably in compressed format (such as JPEG) to a dedicated server forediting and transcoding from pictures and audio tracks to video,preferably WMV. A main portion of the invention is the editing process.Instead of transforming the pictures to video and then editing, thesoftware of the present invention prepares (edits) the video using theoriginal pictures and the original sound track (audio). This methodallows for very simple editing that is extremely light in processingpower. The editing process, as will be explained later, does not requirethe actual transcoding process to take place until the editing iscompleted and the editing instructions are summarized in a specialcommand file (a SIS Command) using special command language. Instead ofrequiring several multiple servers dedicated to the editing process(because the requirement for processing power), the system uses fewerand lighter processing engines. This method also allows a streamliningof the transcoding process (transforming the pictures and associatedaudio to actual video). Once the user finishes editing the pictures andassociated audio, the file containing the instructions on how to use thepictures and audio in the transcoding process are queued for thetranscoding process. This achieves two main goals: (1) very efficientuse of the transcoding engine, which typically requires most of theprocessing power, as files are processed only once, and (2) the filesare queued (video encoding) in real time on their way to their finaldestination (the streaming server) as depicted in FIG. 5.

In one embodiment of the present invention, the system uses two players,one for audio and one for pictures. The editing process is premised oncreating synchronization points between the two multimedia files (asdepicted in FIGS. 3 and 4). In one instance, the pictures are displayedevenly in time according to the length of the audio file. For example,if there are 10 pictures to be transformed to video and the length ofthe audio file is 45 seconds, then each picture will be displayed forapproximately 4.5 seconds. In another instance, the user (who createdthe video) can decide on different allocations of times to each picturebased on the content of the audio file. This can be done eitherinteractively (the user clicks the mouse for example when it desires atransition to the next picture) or by setting manually the timeallocation—in percentages for example.

In another embodiment of the present invention, the audio track for thevideo is obtained using a telephone. The user calls a number, dials inan identification number of work in progress, and the server digitizesand compresses the incoming “audio track” in preparation for the editingprocess. In yet another variant of the invention, the audio track isuploaded from the user's computer where he/she digitizes the audiolocally.

Text-To-Speech

In another embodiment of the invention, which can be used in many otherapplications (beyond Internet commerce), TTS is integrated into thesystem. This is important for both occasional users as well asaggregators (aggregators assist clients who are large sellers (PowerSellers) selling large quantities of items). The occasional user canbenefit from TTS because it provides a professional sounding voice tothe video (and the user can choose from a variety of voice options). Theaggregator (and by extension his customers) can benefit from TTS becauseit can automatically create large amounts of sound tracks for many videoclips (without the need for human interaction).

In a further embodiment of the invention, every picture has itsassociated TTS and the switch to a next picture is performed when theparagraph is over. In yet another embodiment of the current invention,the user can inject quiet periods when needed in order to keep a picturewith short text on the screen longer. Likewise, the user can alsoinclude music or any other external audio clips to supplement the finalvideo.

Focal Points

In yet another element of the present invention, the high resolution ofavailable pictures allow the user to use a much higher quality video byusing what are referred to as focal points. In many cases a singlepicture contains several objects that can be listed on sale. Forexample, a user may want to sell 10 tools and he created a single highresolution picture of all 10 tools. By using 10 focal points on theimage (for example, selecting the focal points by clicks of the mouse),the software of the current invention can create the following: (1) 10different pictures, and (2) a “real” video in which the “camera” isfluidly moving from one object to the next. This is especially usefulwhen there are connections from one object to the next. A user can addpop-up messages and possibly associated text to help in the “selling”process. Such type of pop-up windows can create the mood for thevisualization (add humor) or put a special slant on the object that iseasier to do with a pop-up picture (typically smaller) than with theaudio track.

Picture Security

One of the best ways to sell on the Internet is by using high qualitypictures. These pictures often take a great deal of effort to create.Unfortunately, once the pictures are on the Internet they are easy tocopy and use. There are no good methods to protect pictures. In oneembodiment of the invention, the pictures are transformed to video tofacilitate protection. There are many video formats that support DRM(Digital Rights Management) which provides the protection of thepictures that the user worked hard to create. In one embodiment of theinvention WMV with its DRM system is used for picture protection.

The present invention includes a number of features:

1. Picture to video editing that uses the original pictures and audiotracks in the editing process.

2. (1) when two players are used for the editing process.

3. (1) where the transcoding process is streamlined.

4. Making the video playing length a function of the audio track length.

5. (4) where time allocated to each picture is equal.

6. (4) where the time allocated to each picture is manually set by theuser.

7. Using a telephone to create the audio track.

8. Using text to speech as a source for the sound track.

9. (8) where every picture has its own text and the duration of the timeallocated to each picture is a function of the length of the speechresulting from the text.

10. (8) where automatic picture to video is created for multiplestreamlined advertisements.

11. (10) directed specifically at aggregators.

12. (1) or in (10) that facilitates the insertion of external audioclips and music into the picture to video process.

13. Use of focal points to improve the quality of the video.

14. A picture to video method in conjunction with a video DRM to protectthe pictures from illegal distribution.

Referring now to FIG. 2, a more detailed system 30 of the presentinvention includes a user device 32, which may be a wired or wirelessdevice, is adapted to send pictures 34 via a manual process 36 or anautomated process 38. If a manual process 36 is used, the pictures 40are received by a mixer 40, the pictures are uploaded 42 to a webserver, and can be created 44 and arranged 46 or ordered, panned, orzoomed, via a GUI accessible by the user device 32. A voice can berecorded via an IVR recording 48 or a TTS application 50, and music canbe added 52 via a music interface. The audio and video can then bepreviewed 54. If the automated process 38 is used, a pre-defineddestination 56 and pre-defined commands 58 would perform the uploading,creating, editing, etc. that was previously described in the manualprocess. Audio and music can also be added via the TTS application 50and the music interface 52, respectively. In either process 36, 38, arendering server 60 provides a rendering process that is furtherdescribed in FIG. 5 which results in a video clip that can be stored 62.

Referring now to FIG. 3, a synchronization process 70 of the presentinvention is depicted. The process includes retrieving a URL for a firstplayer 72, setting the first player to the retrieved URL (first URL) 74,retrieving a URL for a second player 76, setting the second player tothe retrieved URL (second URL) 78, starting a timer process 80 (that ismore fully described in FIG. 4), and signaling the first player to beginplaying 82. This process can be used to preview pictures in one mediaplayer and audio in another player or both media and audio in one playerwith the pictures and the audio synchronized.

Referring now to FIG. 4, a timer process 90 of the present invention isdepicted. The process includes determining 92 if a first player isplaying streamed data. If it is not, the process proceeds to step 94where a period of time (typically milliseconds) is elapsed before thedetermining 92 occurs again. If the player is playing the streamed data,signaling 96 a second player to begin playing.

Referring now to FIG. 5, a timer process 100 of the present invention isdepicted. The process includes retrieving a dataset (SIS Commands) forthe unique picture to video (or multi-media production) identifier 102,converting the dataset to an XTL file 104, invoking a rendering engineand providing the XTL file, an output path, a file name, a file type,and an encoding profile 106, and storing the resulting output file 108.

Referring now to FIG. 6 a user interface for management of uploadingimages 200 into the system is depicted. The reference numerals in thefigure include: 202 which is a UI Button that opens another interface(240) to select images from the user's computer to upload, 204 which isa UI Button that opens another interface to select images from a website for upload into the system, 206 which is a UI Button that opensanother simpler interface (260) to allow the user to select images toupload, 208 which is the image “wallet” where all the uploaded imagesare displayed, 210 which is a UI Button that moves selected clips fromthe wallet to the storyboard (288), 212 which is a UI Button that movesall clips from the wallet to the storyboard (288), 214 which is a UIButton that closes the “wallet” window, 216 which is a UI Button thatloads the “Workbench” UI (420), 218 which is a UI Button that loads the“Mixer” UI (280), 220 which is a UI Button that loads the “UploadManagement” UI (200), 222 which is a UI Button that allows the user tomanage their application preferences, 224 which is a UI Button thatdisplays “Help Topics” in a text view, 226 which is a UI Button thatdisplays a video tutorial about the process, and 228 which is an areafor display of textual messages to the user.

Referring now to FIG. 7 a user interface for uploading images from auser's computer 240 into the system is depicted. The reference numeralsin the figure include: 242 which is a file directory listing of the enduser's computer, 244 which is a thumbnail display area of images on theend user's computer, 246 which is a thumbnail display area of imagesselected to be uploaded into the system, 248 which is a set of UIButtons to manage selected images being displayed in 246, 250 which is aUI Button to start the upload process, 252 which is a UI Button toselect how the images are displayed in 244, 254 which is a UI Button toselect how the images are displayed in FIGS. 246, and 256 which is a setof display graphics detailing the number and size of currently uploadedfile.

Referring now to FIG. 8 a simpler user interface to upload images 260into the system is depicted. The reference numerals in the figureinclude: 262 which is a UI Button to open an image selection dialogue,and 264 which is a UI Button to start the upload process.

Referring now to FIG. 9 a “mixer” user interface where an end user canassemble a video production 280 is depicted. The reference numerals inthe figure include: 282 which is a Media Player to play the videoproduction, 284 which is a thumbnail image that represents the originalimage's sequence in the video, 286 which is the “End Clip” marker (whichdesignates the end of the video production), 288 which is the Storyboardwhere the end user interacts with the thumbnail images, 290 which is aUI button that opens the image “wallet” (208), 292 which is a UI buttonthat open the Video Transition Chooser Interface (310), 294 which is atext based help menu system, 296 which is a UI button that opens thevoice recording interface (400), 298 which is a UI button that opens thefemale voice text-to-speech input interface, and 300 which is a UIbutton that opens the male voice text-to-speech input interface (390).

Referring now to FIG. 10 a video transition chooser interface 310 isdepicted. The reference numerals in the figure include: 312 which is alisting of available video transitions to choose from, 314 which is alisting of available transition speeds to choose from, 316 which is a UIbutton to disable use of video transitions, 318 which is a UI button toclose the current window without saving any choices made by the enduser, and 320 which is a UI button to accept the currently selectedtransition type and speed.

Referring now to FIG. 11 a click or context menu that is displayed whena user clicks on a thumbnail image 330 is depicted. The referencenumerals in the figure include: 332 which is a UI button that displays apreview of the image, 334 which is a UI button the creates a duplicateof the current image, 336 which is a UI button that opens the “Pan andZoom” interface window (350), 338 which is a UI button that opens the“Crop Image” interface window (370), 340 which is a set of UI buttonsthat set the duration of the current image, and 342 which is a set of UIbuttons to remove special effects that have been added to the image.

Referring now to FIG. 12 a “Pan and Zoom” interface window 350 isdepicted. The end user makes a selection for the beginning viewport(352) and the end viewport (354). These selections will create simulatedmotion in the end video by morphing the image from the beginningviewport to the end viewport. The reference numerals in the figureinclude: 356 which is the UI button that saves the changes made to the“Pan and Zoom” options, and 358 which is the UI button to cancel anychanges made and close the current window.

Referring now to FIG. 13 a “Crop Image” interface window 370 isdepicted, which allows for a user to select a portion of the originalimage using the selection box (372) and save the new image. Thereference numerals in the figure include: 374 which is a UI button toclose the current interface window, 376 which is a UI button to save thecurrent “cropped” image as a new image, 378 which is a UI button to zoominto the current viewport, and 380 which is a UI button to preview thecropped image.

Referring now to FIG. 14 a male voice text-to-speech input interface 390is depicted. The reference numerals in the figure include: 390 which isthe male voice text-to-speech input interface. This interface allows theend user to type text into the window (392) and have that text processedinto audio by pressing the process speech UI button (394).

Referring now to FIG. 15 a voice recording interface window 400 isdepicted, which allows the user to type a script into the window (406)to be recited during a recording. The reference numerals in the figureinclude: 402 which is a display area for the telephone number to call tomake the voice recording, 404 which is a display area for the codenumber that is inputted into the voice recording system, and 408 whichis a UI button to preview the recorded audio.

Referring now to FIG. 16 a “Workbench” user interface window 420 isdepicted. The reference numerals in the figure include: 422 which is adisplay area for the title and description of the video, 424 which is adisplay area for textual information and help, 426 which is a UI buttonthat triggers a download of the final video to the end user, 428 whichis a UI button that allows the end user to send the video link viae-mail, 430 which is a UI button that allows the end user to view thevideo link, and 432 which is the UI button that allows the end user todownload HTML code to display the video from within a HTML page.

In one embodiment of the present invention, a method for creating avideo clip comprises receiving still image files by a first module,creating a copy of a still image from the still image files with reduceddimensions (such as a thumb nail) by the first module, creating a newstill image from a selected area of the still image (i.e. cropping),storing the new still image and the copy of the still image with aunified file name (UFN), ordering the stored images, creating an audiofile by at least one of a second module and a third module, creating atimeline, by the first module, related to the ordered images and thecreated audio, and rendering the timeline into the video clip by afourth module.

The method further comprises storing at least one of: metadata relatedto the still image and the copy of the still image, metadata related tothe audio file, the timeline, and information comprising the creating ofthe audio file (for example, if a user uses TTS, then the informationthat is stored is the text used to generate the audio. If the user usesa phone number, then the information that is stored is the dialednumber, the calling number, a record number or a code number of the IVRapplication), transitioning from one to another one of the orderedimages, and further comprising storing a transition identifier and atransition length, and retrieving at least one of: the metadata relatedto the audio file, the metadata related to the still image and the copyof the still image, the timeline, and the information comprising thecreating of the audio file, the by the first module, wherein themetadata is stored in a command (such as a SIS Command), wherein themetadata stores at least one of: a height, a width, a size, compressiondata, and a file format.

The method also comprises panning and zooming at least one of: a portionof or the copy of the still image, a portion of or the new created stillimage, a portion of or the stored new still image, and a portion of orthe stored copy of the still image, wherein the panning and the zoomingis a set of coordinates related to the image, wherein the copy of thestill image is used for display purposes (i.e. a thumbnail), wherein thenew created still image is displayed as an image with reduced dimension,wherein the set of coordinates is stored in the timeline. The methodfurther comprises optionally previewing the timeline via a fifth module,wherein the timeline includes at least one of: a length of the audiofile, a length of the video clip, the order of the images, a displaytime of each of the images, transition identifiers, transition lengths,panning and zooming coordinates, an identifier of the audio file createdby the second module or the third module, and a volume envelope (whichincludes “volume points” that are the volume and the time that thevolume occurs) associated with the created audio file.

In another embodiment of the present invention, a system for creating avideo clip comprises a web server adapted to receive one or more images,the web server adapted to create a copy of the images, the web serveradapted to create one or more new images from one or more selected areasof the images, and the web server adapted to provide a unified file nameto each of the new images and a unified file name to each of the copiesof the images, wherein the copy of the still images is created withreduced dimensions.

The system further comprises memory adapted to store at least one of:the new image, the copy of the image, the new image provided with theunified file name, the copy of the image provided with the unified filename, wherein the memory is coupled to the web server, a device coupledto the web server, wherein the device is adapted to order at least oneof: the received images, the copy of the images, the new images, thestored images, and the ordered images, a text-to-speech module, and anintegrated voice response module, wherein at least one of the modules isadapted to create an audio file, wherein at least one of the modules iscoupled to at least one of the web server and the memory, wherein theweb server is adapted to create a timeline related to the ordered imagesand the created audio file, wherein the timeline and the audio file arestored in the memory, and a rendering server adapted to render thetimeline into the video clip.

In a further embodiment of the present invention, a computer readablemedium comprises instructions for: creating a copy of an image withreduced dimensions, creating a new image from a selected area of thecopied image, storing the new image and the copy of the image with aunified file name, and creating a timeline related to the image, the newimage, and to audio related to the image and to the new image, whereinthe timeline is used to create a video clip.

ASSEMBLING AND DISTRIBUTING MULTI-MEDIA OUTPUT

Referring now to FIG. 33, reference numeral 10′ describes the overallflow of multimedia from the user to a final multimedia output such as amultimedia presentation. Reference numeral 11′ is the internalprocessing of the multimedia data and user interaction. Referencenumeral 12′ depicts a storage system on which the multimedia is stored.Reference numeral 14′ is the rendering server. This is hardware and/orsoftware that takes many media files as input and outputs a single file.Reference numeral 16′ describes the voiceover system which is hardwareand/or software that allows a telephone to record audio that is savedinto the storage system 12′. Reference numeral 18′ is the uploader. Itis a software program that is run on the user's machine. This allows theuser to select the media that is desired to be placed into the system.It then encodes it into the proper format and allows the user tocategorize each media file.

Reference numeral 20′ is the user's interface into the system, which ispreferably web-based using the web server and a scripting language.Reference numeral 22′ are raw multimedia files that are chosen by theuser to be saved into the system. Reference numeral 24′ is the enduser's hardware that receives the multimedia presentation. Referencenumeral 26′ is the end user's storage system. This could be CD Rom, DVDand MP3 player hardware, for example. Reference numeral 28′ is the flowof raw multimedia files into the uploader system. Reference numeral 30′are the encoded multimedia files that the uploader sends to the storagesystem 12′. Reference numeral 32′ is data sent by the uploader 18′ intothe storage system 12′. Reference numeral 34′ is an end user query orsearch that is used to populate the user interface. Reference numeral36′ is the flow of data from the storage system 12′ to the userinterface 20′. Reference numeral 38′ are the audio files that thevoiceover system 16′ sends into the user interface 20′ that is then sentinto the storage system 12′ via message 36′. Reference numeral 40′ isthe decision of a destination based on how the information is sent tothe user. Message 42′ is a set of commands sent to the rendering server14′. Reference numeral 44′ is the output from 40′ that is sent to an enduser via email, the web, MMS, SMS or other text messaging options.Reference numeral 46′ are the multimedia files taken from the storagesystem 12′ into the rendering server 14′. The rendering server 14′ takesmany of these data files 46′ and creates a single file which it sends tothe storage system 12′, via message 48 a′. It also has the ability tosend it via messages 48 b′ and 48 c′, to the end user 24′, and/ordirectly to a CD Rom 26′.

FIG. 33 is now further described. The uploader (1) converts and encodesany content (video clips, audio files, animation, graphic, HTML andtext, etc.). The content can be retrieved from disk or direct capturefrom camcorder, web cam, digital camera, camera equipped cell phone,microphone and other similar devices. The upholder also creates aUnified File Name (UFN). The UFN components and a text/XML file (2) withthe same file name provides for index and keyword searching. The contentis then sent to the right project stored on a stand alone PC, a localfile server, or via the Internet to a data center via an FTP site in XMLformat (3). The UFN prevents the need for a proprietary database andallows users to collaborate across different organizations. Using theuser interface 20′ or M-Plat, the users organize content, edit themedia, create movies, add voice over via any telephone and createsdigital presentations (5). The content is organized in projects orretrieved in real time using indexes built into the UFN or a keywordsearch (4).

Using an IVR (telephony) system of the present invention, (6) the usercan add a voice over using any ordinary telephone. The M-Plat alsocontrols publishing, distribution, reporting and archiving. Ifdistributed via e-mail or web site, the instruction sets (SISCommands)are stored on-line and e-mail notifications are sent (7). If a new fileis required, the SISCommand is sent to the M-Gen (8) and a new file isrendered. The new file is stored in the project and is sent to mobileusers (9) via a cellular network or destination device via an FTP site(10).

FIG. 33 further depicts a system for assembling and distributingmulti-media output, comprising: a rendering server; a web server; andstorage, wherein the servers and the storage are operably coupled; thestorage adapted to receive digital media and properties of the media,store the media and the properties, and transmit the media and theproperties; the web server adapted to perform at least one of afollowing action: retrieve the media and properties of the media;manipulate the media and the properties; assemble the properties; andtransmit at least one of a following element from a group consisting of:the properties; and the assembled properties; and the rendering serveradapted to receive commands from the web server. The commands include atleast one of a following element from a group consisting of: theproperties; and the assembled properties; and based on the commands,performs at least one of a following action: retrieve the media based onthe commands; render the retrieved media; and store the retrieved mediaon the storage; and transmit the retrieved media to a destination. Thesystem further comprises an audio capture module operably coupled to theweb server, the audio capture module adapted to capture audio and DTMFtones, encode the captured audio, and transmit the encoded audio andinformation related to a call involved with generating the DTMF tones.The digital media comprises at least one of a following type of mediafrom a group consisting of: video; audio; still images; fileattachments; animation; and HTML. The manipulation of the mediacomprises at least one of a following action: copy the media; delete themedia; and rename the media. The manipulation of the properties isadapted to change a value of the properties. The assembly of theproperties is adapted to sequence the properties associated with each ofthe media. The transmission of the properties is adapted to transmit atleast one of a following element from a group consisting of: thesequence; the properties and the media. The commands further include atleast one of a following element from a group consisting of: adestination; and 5.a type of the media.

Referring now to FIG. 34, reference numeral 50′ is an over all processfrom the receipt of commands 44′ to the receipt of the multimediapresentation 56′. The process starts with the receipt of sis commands44′, to the end user 24′. Reference numeral 52′ is the user's request toview the multimedia presentation. This is sent to the storage system12′. Reference numeral 54′ is a decision based on what type of hardware24′ the user originates from; either a computer or a mobile device. Ifit is a computer, then 56′ shows the streaming of the media, themultimedia presentation to the end user's computer. If it is a mobiledevice, then a decision 58′ has to be made on what type of device it isand how to send the multimedia to it. If the decision is that the mobiledevice can handle stream multimedia 56′, the media is sent to the device24′. If the mobile device cannot handle stream media, then a set ofcommands 42′ is sent to the rendering server 14′ to create a single filewhich is then sent via 60′ to the mobile device 24′.

FIG. 34 is further described below.

The SISCommands are sent to recipients via the Internet (1) or wirelessnetworks (2). Upon request the content is played back using streamingtechnology. (3) In the wireless environment the network carrierdetermines the right format for streaming or download (4). The receivingparty may choose to respond or forward the message and can even addVoice Over using any telephone (5). The reply/forward message is storedon the project and notification is sent to the receiver (6). Uponrequest for play back the content is sent to Internet Users usingStreaming Technology (7). Mobile users, upon determinating the rightplayer, receive stream media (8) or a new, downloadable file, via theM-Gen (9). Since the SIScommand are small (1-5 k) compared to anytypical Rich Media file (1-100 MB), storage space and airtime arelargely reduced.

Referring now to FIG. 35, the uploader 18′ is depicted. The process flowfrom login to the system until the media is sent to the storage system12′ is described. At reference numeral 70′, the user logs in with a setof credentials. At reference numeral 72′, a check is made to see if thecredentials are valid. If they are not valid, the application exits 74′.If a current library is found with the user's credentials 73′, then arequest is made to get the project information 84′. If a project is notfound, one path to 76′ will allow them to solve an application.Reference numeral 78′ allows a choice of the look and feel, thebackground, the color scheme, and then 80′ sends a notification to havethe project built. On a simpler version, a new project 82′ is createdautomatically. Once a project exists for the logged-in user 84′, projectinformation is then requested from the server 12′. At reference numeral86′, if previous multimedia has already been encoded and is ready tosend, it goes directly to 104′ and is uploaded into the system. Atreference numeral 88′, the user can choose to work in an offline mode inwhich the data is not sent to the server after it is finished encodingbut waits until a later time. Reference numeral 90′ is a decisionwhether to capture multimedia data directly from the computer or toselect files. If the user wishes to capture live data directly from thecomputer, then a capture device 92′ is used. After the files have eitherbeen captured or selected 94′, a jpeg image is extracted from the fileto allow the media to be represented by a graphic icon or thumbnail.This happens automatically. Reference numeral 96′ is a check foradvanced options. If the user does not have advanced options, then 98′they are given an opportunity to select a custom graphic or jpeg to beused as a thumbnail 98′. At reference numeral 100′, if they do haveadvanced options, then they are allowed to select a graphicrepresentation or thumbnail to categorize using the UFN (unified filename) and to type in a description of this media file. At referencenumeral 102′, the media is then encoded into the proper streaming formatand at reference numeral 104′, it is uploaded into the storage system12′. At reference numeral 106′, the data on the user's machine is thendeleted and at reference numeral 108′, the user receives a “done”message that the process has been completed.

Referring now to FIG. 36 a, a login screen of the uploader is depicted.At reference numeral 70′, the login tab 112′, and the user input 114′for the library ID 206′ are shown. The user input 116′ for the projectID 208′, the input for the user's user name 118′ into the system 202′,the user's input 120′ for the password, the button the user clicks toverify their login information 122′, and a checkbox that the user candetermine whether they are in online or offline mode 124′ as shown onFIG. 35 (reference numeral 88′) are shown.

Reference numeral 128′ is a button to clear the cache. This removes anyextraneous files on the user's desktop and is the same process as shownin FIG. 35 (reference numeral 106′).

Referring now to FIG. 36 b, the main screen of the application 90′(which allows users to select digital media from their local computer)includes embedded help videos 126′ The following are banners andbackgrounds used by the application: video 132′, screen shots 134′, fileattachments 136′, static pictures or graphics 138′, audio files 140′,look and feel 142′, HTML gilrd 144′, beginning the encoding anduploading 146′, a descriptive help text 148′, an area where the selectedfiles are displayed 150′, a button that allows users to add files 152′,clearing any files 154′ in the list 150′, removing only files that arechecked 156′ in the list 150′, another textural help box 158′ allowingthe user to choose different formats and profiles for their encodingsessions 160′, allowing the users to view online help 162′, allowing theusers to capture a custom thumbnail from an image in the media 164′, anda “next” button 166′ which takes the users to a next step or area.

Referring now to FIG. 36 c the thumbnail extractor 94′ is depicted.

Referring now to FIG. 36 d the encoding and uploading screen, 102′ and104′, are depicted.

Referring now to FIG. 37, the processing 170′ which the user categorizestheir media by choosing its UFN (unified file name), is depicted.Reference numeral 172′ is the unique contact ID of the person uploadingor storing the file, reference numeral 174′ is a general category thatdescribes the content of the file, reference numeral 176′ is anothercategory or subcategory describing the file, reference numeral 178′ isthe creator of the file, reference numeral 180′ is the date the file wascreated, reference numeral 182′ is the version or sequence number andreference numeral 184′ can be any user defined codes. When all thesedifferent categories and codes are put together, you end up with a UFNthat is unique to this file.

UNF is done by grouping together set of codes, ID's and dates. Theactual code naming can be done by the end-user or automaticallyfollowing sets of rules (for example, a predefined set of rules). Themain advantages of the UNF are that it is virtually impossible to createa duplicate file name by any user, and a query and retrieval of specificdata/raw material can be done directly by the Operating System. There isno need for an agreed-upon database in order to share data among usersand cross organizations.

Referring now to FIG. 38, the user interface to choose the UFN for amedia file is disclosed. The screen shot of the user interface 170′includes the general category 174′, the class or subcategory 176′, thecreator 178′, the creation date 180′, the sequence number 182′, thethumbnail or the graphic representation of this digital media 184′, thetextural description 186′ that can be entered by the user, the section188′ in which this file will be placed in the storage system 12′, adisplayed 190′ UFN, a checkbox 192′ that the user selects when donechoosing all the categories, and, to process or import into the systemany checked files, button 194′ is used

The button “copy down” 196′ allows the user to copy 174′, 176′, 178′,180′, 186′, 188′ to the fields below it. Reference numeral 198′ depictsa set of fields that have been chosen, reference numeral 199′ shows thatin the UFN, the files automatically identified by the type of file (thatis determined from FIG. 36 b, 132′ through 144′), reference numeral 200′is the media player in the user interface that is both used to displayhelp clips and display the media as its playing, reference numeral 202′is the user's login or user name, reference numeral 204′ is their accesslevel, reference numeral 206′ is the unique library ID that they arecurrently in, and reference numeral 208′ is the unique project they arecurrently in.

Referring now to FIGS. 39 a and 39 b an alternate user interface to FIG.38 is depicted.

Users are capable of defining the source, purpose, type, creator anddate created while dumping the raw material (video, audio or pictures)or creating the files (text and images). The following steps are taken:

The user name and captured date are collected, a unique ID is set toeach user and or production, text describing the them and the flow isadded, category and classes are added from a pre-defined, self learningdatabase, abbreviation of the creator name and the creation date areadded to create a unique identifier, file type is automaticallycollected and a code is added to the file name, and a unique file namewith time stamp is generated and the data is stored.

Regardless of the source, the file type and the user, the file storedusing the SISController allows for storage and retrieval of all types ofdata and digital video, pictures and audio allowing different users tocollaborate. The unique file naming prevent duplications and can beretrieved either by using a proprietary database with full text searchcapability, search by defined filed, or directly by any operatingsystem.

The present invention further describes a method for creating a unifiedfile name that comprises: assigning a unique identifier based on adestination of a file; assigning a code based on a type of the fileafter the unique identifier; assigning a code based on a user definedcategory after the code based on the file type; assigning a code basedon a user defined sub-category after the code based on the user definedcategory; assigning a code related to at least one of: a creator of thefile; and a creator of a content of the file, after the code based onthe user defined sub-category; and assigning a creation date of at leastone of: the creator of the file; and the creator of the content of thefile, after the previously assigned code. The method further optionallycomprises assigning a version of the file after the creation date andoptionally comprising at least one user defined code after the assignedversion.

Referring now to FIG. 24, reference numeral 220′ shows three processflows for the exchanging of digital media between two entities.Reference numeral 221′ is the path for publishing in which files areselected to be published or authored 222′, terms for the purchase orreuse of the media to be specified 224′, and the publishing of the mediaalong with their terms 226′. Reference numeral 227′ is the process ofpurchasing digital media in which the purchaser makes a response 228′ toany published media from 226′. The buyer or purchase can 230′ modify theoffer or 232′ make a bid on the digital media that they wish topurchase. Reference numeral 234′ is the acceptance by the publisher ofthe offer or bid for the digital media. Reference numeral 235′ is theprocess flow of the digital media after an agreement has been reached onits purchase in which the original file from 222′ is copied in theowner's project 236′ a jpeg image or thumbnail is created into thepurchaser or buyer's project 238′ and a tracker or reporting system isactivated for this piece of digital media 240′.

Referring now to FIG. 25, reference numeral 221′ is the process that theowner of digital media uses to publish the media along with its terms.At reference numeral 222′, the owner can view a list of current catalogsor stores that contain current digital media. The owner then has achoice to edit or delete current catalogs 225′ create a new catalog ofcontent 223′. At reference numeral 228′, the owner selects clips to bepublished, creates a description for the catalog or store and then savesit. At reference numeral 224′, the owner of the digital media sets forththe terms for the purchase of their media, selects look and feel,banner, backgrounds, color schemes, etc.k and at reference numeral 226′the list of media is then published on an electronic storefront.Referring now to FIG. 26 a, reference numeral 221′ is the user interfacefor publishing or exporting digital media for purchase.

Referring now to FIG. 26 b is the user interface to specify terms 224′is depicted. Reference numeral 500′ is the field code that becomes partof the UFN, reference numeral 502′ is a short textural description,reference numeral 504′ is the business description that is used in theexchange agreement, and reference numeral 506′ is the catalogdescription which is used when the site is published in 226′.

Referring now to FIG. 27, the process 34′ that a user creates a query orsearch into the storage system 12′ to populate the user interface 20′ isdepicted. At reference numeral 241′ the user starts the process. Theuser can select from a plurality (1-6 for example) of different searchvariables. Reference numeral 242′ is the client ID to search on,reference numeral 244′ are categories to search, reference numeral 246′are subcategories to search, reference numeral 248′ are creators tosearch, reference numeral 250′ can be a range of creation dates tosearch, reference numeral 252′ are any key words contained in thedescription of the file to search on, reference numeral 254′ ends theuser selections, at reference numeral 256′, the query is sent to thedatabase, at reference numeral 258′, data is returned to populate theuser interface.

Retrieving any data and populating the user interface 20′ is done bysimple query, following the same coding system that created the filename or by keyword search. This presents the data as a set of images.The human brain can process images by far much faster than text.Efficiency and productivity increase and there is no need for userstraining.

Referring now to FIG. 28, the overall process flow 259′ that the userfollows to create their multimedia presentation is depicted. Referencenumeral 260′ begins the process. At reference numeral 262′, media filesare selected and optionally start and end times within the media fileare selected, at reference numeral 264′, text audio tracts or specialeffects are selected. At reference numeral 266′, a decision is made towhether the user wishes to add more digital media. If yes, they canselect a transition 268′ between medias that takes the user back to262′. If they do not wish to add any more media, they select theiroutput type 270′. If the output type is rich media 272′, they select adestination. At reference numeral 274′, the job can be submitted to therendering server, at reference numeral 278′, the instructions are sentto the chosen destination. After the process is complete the user isprompted to create another multimedia presentation 280′. If they selectno 282′, the process ends. If they select yes 260′ starts the processover again. At reference numeral 276′, if a user chooses to output syscommands only, then they select a destination 278′ and commands are sentto that destination.

The user identifies (visually or by text) the desired clip (1) and canplay or run the associated application on the user interface 20′ displaywindow. Text and media can be selected from pre-defined menus (2). (Themenus are defined by the system administrator/service provider).Transition types are selected as well (3). If a Rich Media output isselected (4) then the SISCommands 42′ are sent directly to the RenderingServer (5) for production, otherwise the SISCommands are sent to anotherend-user, portable device, service provider or storage (6). The processis fast and requires no training. A novice user can produce a rich mediapresentation in minutes, a task that otherwise requires a studio andmany hours of labor by highly trained professionals.

Referring now to FIG. 29 a, the user interface 20′ is depicted. Once theuser has selected their clips, they select different templates in whichthe multimedia presentation resides 284′.

Referring now to FIG. 29 b, the user interface 290′ for various optionsfor the multimedia presentation is depicted.

Referring now to FIG. 29 c, the destination selection 276′ is depicted.

Referring now to FIG. 30 a, the user interface 262′ for selecting startand end times within a clip is depicted.

Referring now to FIG. 30 b, the user interface 263′ for sequencingclips, and selecting text, audio tracts, special effects, andtransitions 264′-268′ is depicted.

Referring now to FIGS. 31 a-31 b an alternate process controlled by userinterface 20′ is depicted. The process is described below. Upon startinga new project, raw video is collected from all sources including theuser's PC, dedicated servers, other stations on the network and via theInternet, for example. Time stamps are captured for the “START” and“STOP” of individual clip, and audio from different sources (such asmusic, voice, sound effects) is selected. The user provides textinformation that is played as banner (at a bottom of movie, for example)or a stand alone picture. Delivery information is selected whichincludes: physical media (CDR) or sending rendering instructions to anend user via e-mail.

Referring now to FIG. 32, the voiceover process flow 16′ for capturingaudio over a telephone, coding it into a string format, and saving itonto a storage system 12′ is depicted. Reference numeral 300′ is anincoming call where Caller ID is captured. At reference numeral 302′,the number that was dialed is detected and, depending on which numberthe user dialed, different greeting sets are encountered. Referencenumeral 304′ is the standard greeting set, and reference numeral 306′ isa custom greeting set with help and samples. At reference numeral 310′,the user's input is captured to direct them to the help system 312′, thesample system 314′ or to the prompt for voice recording 318′.

Reference numeral 308′ is another customer greeting set that includes asubset of greetings 316′. At reference numeral 318′, the user isprompted to record their message. At reference numeral 320 a′ decisionis made regarding DTMF enabling. At reference numeral 322′, the DTMFcapturing is set to on, at reference numeral 324′, the audio isrecorded, at reference numeral 326′, if advanced options is enabled thisallows the user to 328′, play back the recording 328′ or re-record theirmessage 330′. At reference numeral 332′, after recording is finished,the voice file is then stored on the storage system 12′, at referencenumeral 334′, the caller's number is also stored on the storage system,at reference numeral 336′, if DTMF tones were captured, they are alsostored on the storage system, and at reference numeral 38′ (which refersback to FIG. 33), data flows from the voiceover system into the userinterface into the storage system 12′.

FIG. 32 is further described below. Upon calling in, the caller ID iscaptured (1) and, based on the call in phone number, a greeting to playis selected (2). The system administrator can set any number of voiceboxes and greeting paths. In this example, set 1 goes directly to voiceprompt, set 2 provides for help and pre-recorded samples and set 3 is acombination of several voice boxes. The user is then prompted to recordhis voice upon a tone signal (3). If the DTMF option (4) is enabled thenDTMF tones and time is captured. If playback and re-record option isenabled then the user is promoted (5). The voice file is encoded andstored on the server (6) as well as the caller ID (7) and DTMF time code(8).

Referring now to FIG. 33 a, the user interface 340′ to the voiceoversystem console is depicted. Reference numeral 341′ is a button thatstarts or stops the software, reference numeral 342′ is the systemstatus display page, reference numeral 344′ is the host settings,reference numeral 346′ is the media and coding settings, and referencenumeral 348′ is the voice recording settings. Reference numeral 350′shows the current status of the phone lines, reference numeral 352′displays the name of the group that each phone line is associated with,reference numeral 354′ displays the current time spent in each step ofprocess 16′, reference numeral 356′ shows the total accumulated durationof the current call, and reference numeral 358′ shows the currentapplication designated for each phone call.

Referring now to FIG. 33 b, the user interface 346′ for the media andcoder settings are depicted. Reference numeral 360′ includes user inputsfor the settings, and reference numeral 362′ is the testing anddebugging interface.

Referring now to FIG. 34 a, a sample multimedia presentation 370′ isdepicted. Reference numeral 372′ is a timeline to allow the user tosequence a subset of clips to play their own personal movie. Referencenumeral 374′ is the interface for a user to record their own personalvoice message. Reference numeral 376′ allows the end user to send a copyof their personalized presentation via email.

Referring now to FIG. 34 b, an alternate sample of the end user'sexperience from a multimedia presentation 371′ is depicted.

FIG. 35 a includes Instruction Set A: to play the starting animation andstay in loop until movie 1 is playing for a defined minimum time, ifapplicable (buffering done). At Instruction Set B: As soon as movie 1ends play Transition and stay in loop while Movie is buffering for adefined minimum time, if applicable. At Instruction Set C: Upon Event A,activate the Special Effect 1 and upon Event B activate the SpecialEffect.

FIG. 35 a is further described below:

1. The page is loaded and the Starting Animation plays.

2. At the predefined time Movie 1 is called to play. Buffering starts 3while Starting Animation stays in a loop.

3. As soon as Movie 1 starts, Starting Animation stops.

4. Movie 1 plays. Once finished, the Transition Animation plays.

5. At the predefined time Movie 2 is called to play. Buffering startswhile Transition Animation stays in a loop.

6. Movie 2 plays until the end.

7. Upon Event A (Mouse click, DTMF signal, key pad pressed etc.),Special Effect 1 plays.

8. Upon Event B (Mouse click, DTMF signal, key pad pressed etc.) Specialeffect Animation 2 plays.

9. Animation ends.

Referring again to FIG. 35 a, 379′ is the process flow to triggeranimation events based on instructions sent while streaming media isplaying, 380′ is the animation file, 382′ is the instruction set file,56′ is the streaming file, 384′ is a starting animation as it is playedafter the page is loaded, 386′ refers to the event of the page load orthe page starting while the animation is playing, 388′ shows the startof the streaming files buffering event, 394′ shows the effect of thatevent which stops the beginning animation, 396′ is the event that thestreaming file has finished and triggers 398′, the start of thetransition 400′ in the animation file, 402′ is the start of the nextstreaming file, 404′ is its buffering event, 406′ is the event that thebuffering is completed, 408′ is the instruction to start the transitionfrom the animation file, 410′ is the event triggered from theinstruction set 382′ which plays special effect animation 412′, 414′shows another triggered event from the instruction set that plays aseparate animation special effect 416′, 418′ is the event that thestreaming file has ended, and 420′ sends a command to the animation fileto play the ending animation 422′ (FIG. 35 b).

Referring now to FIG. 36, the following steps occur:

1. Content is uploaded to the server from existing data files, videocamera or cam-equipped cell phone.

2. Using the web based M-Plat any user can customize the content, createa personalize movie, add a voice over and send to any PC, web site orcell phone.

3. The web server holds the content and customized messages. The streamserver distributes the content.

4. If the message is to be sent to a cell phone, the M-Gen receives thetime stamps and creates a new file using DES and the media encoders.

5. The M-Gen transfers the appropriate file and destination informationto the carrier. Content is forwarded to the devices (download orstreaming).

6. Mobile user can respond to the message by SMS and/or voice. The voicemessage is embedded in the response e-mail.

Referring further to FIG. 36 a, a process flow 429′ for receiving andreplying to, a multimedia message received by a global device user isdepicted. This is an abbreviated process flow from FIG. 33 and FIG. 34and includes raw media 22′ that is stored on storage devices 12′. Amultimedia presentation is created at the user interface 20′ and then issent 42′ to the rendering server 14′, which creates a single file andsends it to the mobile device 60′. The mobile device receives it 24′,432′, the mobile device user has a chance to reply to this presentation,25′ and sends the reply via 434′ back to the storage system.

Referring now to FIG. 37, 430′ is the information flow from the userthrough the rendering server with a single file being created then andsome samples of the information.

At 42′ the user submits instructions to the rendering server, 438′ therendering server retrieves one or more file, creates a single file,440′, that single file is sent to the user, 442′ the process ends. 444′is an example of possible commands created by the user that therendering server uses to assemble files into a new single file. The boxlabeled 262′-268′, 276′, 290′ is an example of the data that is capturedby the user to create his multimedia

Referring now to FIG. 38, 14″—this is the process flow of the renderingserver, 14′, from the rendering server's prospective.backslash., 42′—oneor more sets of instructions have been to the rendering server, 46′—therendering server accesses raw files from the storage system, 12′.456′—the rendering server runs a process that combines one or more filesinto a new single file. 48A′-48C′—is the return of this single file fromthe media from the rendering server back to the user.

Referring now to FIG. 39, 14′″—this is an expanded view of the renderingserver process. It begins 42′ a set of commands is received 42″ is thelist of possible fields included in this command set, 470′ refers to ajob control process. This process is responsible for initiating theactual rendering of the files, 474′ is the actual rendering process onceit is initialized, 476′ is the timeline of the final movie that needs tobe outputted. It retrieves data from 46′ audio, video, pictures, rawdata from system 12′, the storage system, 444′ is a project file thatdescribes how they combine and render the files 46′, 478′ is the rawdata stream of a single file that has been created, 480′ is the outputprocess for rendering engine. Depending on the format of the outputfile, it has the ability to take as an input 472′ and encode a projectfile. This project file contains all the parameters needed to reduce astreaming media file. This is an optional input. 480′ also has theability to produce a file in a non-streaming format. 482′ is the finalproduct for file that is stored on 48A′ or sent through 48C′ and 48B′.

Although an exemplary embodiment of the system, method, and computerreadable medium of the present invention has been illustrated in theaccompanied drawings and described in the foregoing detaileddescription, it will be understood that the invention is not limited tothe embodiments disclosed, but is capable of numerous rearrangements,modifications, and substitutions without departing from the spirit orscope of the invention as set forth and defined by the following claims.For example, the capabilities of the systems 10 and 30 can be performedby one or more of the modules or components described herein or in adistributed architecture. For example, all or part of the functionalityperformed by the individual modules, may be performed by one or more ofthese modules. Further, the functionality described herein may beperformed at various times and in relation to various events, internalor external to the modules or components. Also, the information sentbetween various modules can be sent between the modules via at least oneof: a data network, the Internet, a voice network, an Internet Protocolnetwork, a wireless device, a wired device and/or via plurality ofprotocols. Also, the messages sent or received by any of the modules maybe sent or received directly and/or via one or more of the othermodules.

1. A method, comprising: creating a copy of a still image by a firstmodule, the copy of the still image comprising reduced dimensions of thestill image; creating a new still image from a selected area of thestill image; automatically ordering the new still image and the copy ofthe still image; creating an audio file by at least one of a secondmodule and a third module; creating a timeline, by the first module,related to the ordered images and the created audio file; and renderingthe timeline into a video clip by a fourth module, wherein the timelineincludes: a length of the audio file, a length of the video clip, theordered images, a display time of each of the ordered images, transitionidentifiers, transition lengths.
 2. The method of claim 1 comprisingstoring at least one of: metadata related to the still image and thecopy of the still image, metadata related to the audio file, thetimeline, or information comprising the creating of the audio file. 3.The method of claim 2, wherein the metadata is stored in a command. 4.The method of claim 2, wherein the metadata stores at least one of: aheight; a width; a size; compression data; and a file format.
 5. Themethod of claim 1, wherein the copy of the still image is used fordisplay purposes.
 6. The method of claim 1, wherein the new createdstill image is displayed as an image with reduced dimension.
 7. Themethod of claim 1 comprising panning and zooming at least one of: aportion of or the copy of the still image, a portion of or the newcreated still image, a portion of or the stored new still image, or aportion of or the stored copy of the still image, wherein the panningand the zooming is a set of coordinates related to the image.
 8. Themethod of claim 7, wherein the set of coordinates is stored in thetimeline.
 9. The method of claim 2 comprising transitioning from one toanother one of the ordered images, and further comprising storing atransition identifier and a transition length.
 10. The method of claim 2comprising retrieving at least one of: the metadata related to the audiofile, the metadata related to the still image and the copy of the stillimage, the timeline, or the information comprising the creating of theaudio file, by the first module.
 11. The method of claim 1 comprisingoptionally previewing the timeline via a fifth module.
 12. A system,comprising: a device configured to create an audio file; and a webserver configured to create: a copy of a still image; a new still imagefrom a selected area of the still image, wherein the new still image andthe copy of the still image are automatically ordered; and a time linerelated to the ordered images and the created audio file, wherein thetime line includes a length of the audio file.
 13. The system of claim12, wherein the copy of the still image is created with reduceddimensions.
 14. The system of claim 12 comprising memory adapted tostore at least one of: the new image, the copy of the image, the newimage provided with the unified file name, or the copy of the imageprovided with the unified file name, wherein the memory is coupled tothe web server.
 15. The system of claim 14 wherein the device is coupledto the web server, and wherein the device is adapted to order at leastone of: the received images, the copy of the images, the new images, thestored images, or the ordered images.
 16. The system of claim 14comprising at least one of: a text-to-speech module; and an integratedvoice response module; wherein at least one of the modules is adapted tocreate an audio file; and wherein at least one of the modules is coupledto at least one of the web server and the memory.
 17. The system ofclaim 16, wherein the web server is adapted to create a timeline relatedto the ordered images and the created audio file, wherein the timelineand the audio file are stored in the memory.
 18. The system of claim 17comprising a rendering server adapted to render the timeline into thevideo clip.
 19. A non-transitory computer readable medium, comprisinginstructions for: creating a copy of a still image by the first module,the copy of the still image comprising reduced dimensions of the stillimage; creating a new still image from a selected area of the stillimage; automatically ordering the new still image and the copy of thestill image; creating an audio file by at least one of a second moduleand a third module; creating a timeline, by the first module, related tothe ordered images and the created audio; and rendering the timelineinto a video clip by a fourth module, wherein the timeline includes alength of the audio file, an identifier of the audio file created by thesecond module or the third module, or a volume envelope associated withthe audio file.
 20. The non-transitory computer readable medium of claim19 comprising instructions for: storing at least one of: metadatarelated to the still image and the copy of the still image, metadatarelated to the audio file, the timeline, and information comprising thecreating of the audio file; and transitioning from one to another one ofthe ordered images, and further comprising storing a transitionidentifier and a transition length.